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Abstract. We present a packing-based approximation algorithm for the fc-Set 
Cover problem. We introduce a new local search-based fc-set packing heuristic, 
and call it Restricted fc-Set Packing. We analyze its tight approximation ratio via a 
complicated combinatorial argument. Equipped with the Restricted fc-Set Packing 
algorithm, our fc-Set Cover algorithm is composed of the fc-Set Packing heuristic 
[7 1 for fc > 7, Restricted fc-Set Packing for fc = 6, 5, 4 and the semi-local (2, 1)- 
improvement |2| for 3-Set Cover. We show that our algorithm obtains a tight 
approximation ratio of Hk — 0.6402 + 0{j), where Hk is the fc-th harmonic 
number. For small fc, our results are 1.8667 for fc = 6, 1.7333 for fc = 5 and 
1.5208 for fc = 4. Our algorithm improves the currently best approximation ratio 
for the fc-Set Cover problem of any fc > 4. 



1 Introduction 

Given a set of elements U and a collection of subsets of U with each subset of 
having size at most fc and the union of S* being U, the fc-Set Cover problem is 
to find a minimal size sub-collection of whose union remains U. Without loss of 
generality, we assume that S" is closed under subsets. Then the objective of the fc-Set 
Cover problem can be viewed as finding a disjoint union of sets of 5? which covers U . 

The fc-Set Cover problem is NP-hard for any fc > 3. For fc = 2, the 2-Set Cover 
problem is polynomial-time solvable by a maximum matching algorithm. The greedy 
approach for approximating the fc-Set Cover problem chooses a maximal collection of 
i-sets (sets with size i) for each i from fc down to 1 . It achieves a tight approximation ra- 
tio Hk (the k-th harmonic number) [9 |. The hardness result by Feige [3| shows that for 
n = \U\, the Set Cover problem is not approximable within (1— e) In n for any e > un- 
less NPCDTIME(n log log ™). For the fc-Set Cover problem, Trevisan flU shows that no 
polynomial-time algorithm has an approximation ratio better than In k — J? (In In fc) un- 
less subexponential-time deterministic algorithms for NP-hard problems exist. There- 
fore, it is unlikely that a tremendous improvement of the approximation ratio is possible. 

There is no evidence that the In k — J? (In In k) lower bound can be achieved. Re- 
search on approximating the fc-Set Cover problem has been focused on improving the 
positive constant c in the approximation ratio Hk — c. Small improvements on the con- 
stant might lead us closer to the optimal ratio. One of the main ideas based on greedy 
algorithms is to handle small sets separately. Goldschmidt et al. [4| give a heuristic 
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using a matching computation to deal with sets of size 2 and obtain an Hk — | ap- 
proximation ratio. Halldorsson [5] improves c to ^ via his "t-change" and "augmenting 
path" techniques. Duh and Fiirer [2] give a semi-local search algorithm for the 3-Set 
Cover problem and further improve c to ^. They also present a tight example for their 
semi-local search algorithm. 

A different idea is to replace the greedy approach by a set-packing approach. Levin 
[ 1 1 uses a set-packing algorithm for packing 4-sets and improves c to 0.5026 for fc > 4. 
Athanassopoulos et al. |Q] substitute the greedy phases for fc > 6 with packing phases 
and reach an approximation ratio — 0.5902 for k > 6. 

The goal of this paper is not to provide incremental improvement in the approxima- 
tion ratio for fc-Set Cover. We rather want to obtain the best such result achievable by 
current methods. It might be the best possible result, as we conjecture the lower bound 
presented in lfl2l not to be optimal. 

In this paper, we give a complete packing-based approximation algorithm (in short, 
PRPSLI) for the fc-Set Cover problem. For fc > 7, we use the fc-set packing heuristic 
introduced by Hurkens and Shrijver |7 1, which achieves the best known to date approx- 
imation ratio f — e for the fc-Set Packing problem for any e > 0. On the other hand, 
the best hardness result by Hazan et al. [6| shows that it is NP-hard to approximate the 
fc-Set Packing problem within flQ^). 

For fc = 6, 5, 4, we use the same packing heuristic with the restriction that any local 
improvement should not increase the number of 1-sets which are needed to finish the 
disjoint set cover. We call this new heuristic Restricted fc-Set Packing. We prove that 
for any fc > 5, the Restricted fc-Set Packing algorithm achieves the same approximation 
ratio as the corresponding unrestricted set packing heuristic. For fc = 4, this is not the 
case. The approximation ratio of the Restricted 4-Set Packing algorithm is y^, which 
is worse than the | — e ratio of the 4-set packing heuristic but it is also tight. For fc = 
3, we use the semi-local optimization technique [2|. We thereby obtain the currently 
best approximation ratio for the fc-Set Cover problem. Table 1 (in Appendix Section 
5) includes a comparison of the approximation ratio of our algorithm with GRSLIk^ 
|2l . Levin's algorithm ifTUl and PRSLIk,5 [1|. We also show that our result is indeed 
tight. Thus, fc-Set Cover algorithms which are based on packing heuristic can hardly 
be improved. Our novel Restricted fc-Set Packing algorithm is quite simple and natural, 
but its analysis is complicated. It is essentially based on combinatorial arguments. We 
use the factor-revealing linear programming analysis for the fc-Set Cover problem. The 
factor-revealing linear program is introduced by Jain et al. [8] for analyzing the facility 
location problem. Athanassopoulos et al. (H are the first to apply it to the fc-Set Cover 
problem. 

The paper is organized as follows. In Section 2, we give the description of our 
algorithm and present the main results. In Section 3, we prove the approximation ratio 
of the Restricted fc-Set Packing algorithm. In Section 4, we analyze our fc-Set Cover 
algorithm via the factor-revealing linear program. 
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2 Algorithm Description and the Main Theorem 

In this section, we describe our packing-based fc-Set Cover approximation algorithm. 
We first give an overview of some existing results. 

Duh and Fiirer [2| introduce a semi-local (s, t)-improvement for the 3-Set Cover 
problem. First, it greedily selects a maximal disjoint union of 3-sets. Then each local 
improvement replaces t 3-sets with s 3-sets, if and only if after computing a maximum 
matching of the remaining elements, either the total number of sets in the cover de- 
creases, or it remains the same, while the number of 1-sets decreases. They also show 
that the (2, 1) -improvement algorithm gives the best performance ratio for the 3-Set 
Cover problem among all semi-local (s, t) -improvement algorithms. The ratio is proved 
to be tight. 

Theorem 1 ( |2|). The semi-local (2, l)-optimization algorithm for 3-Set Cover pro- 
duces a solution with performance ratio |. It uses a minimal number of 1-sets. 

We use the semi-local (2, 1) -improvement as the basis of our fc-Set Cover algorithm. 
Other phases of the algorithm are based on the set packing heuristic [ 7 1 . For fixed s, the 
heuristic starts with an arbitrary maximal packing, it replaces p < s sets in the packing 
with p + 1 sets if the resulting collection is still a packing. Hurkens and Shrijver Q 
show the following result, 

Theorem 2 ( |7|). For all e > 0, the local search k-Set Packing algorithm for parame- 
ter s — 0(log fe =•) has an approximation ratio f — e. 

The worst-case ratio is also known to be tight. We apply this packing heuristic for 
fc > 7. For k = 6,5,4, we follow the intuition of the semi-local improvement and 
modify the local search of the packing heuristic, requiring that any improvement does 
not increase the number of 1-sets. We use the semi-local (2,l)-improvement for 3-Set 
Cover to compute the number of 1-sets required to finish the cover. Lemma 2.2 in |2| 
guarantees that the number of 1-sets returned by the semi-local (2,l)-improvement is 
no more than this number in any optimal solution. We compute this number first at the 
beginning of the restricted phase. Each time we want to make a replacement via the 
packing heuristic, we compute the number of 1-sets needed to finish the cover after 
making the replacement. If this number increases, the replacement is prohibited. To 
summarize, we call our algorithm the Restricted Packing-based fc-Set Cover algorithm 
(PRPSLI) and give the pseudo-code in Algorithm 1 . For input parameter e > 0, S; is 
the parameter of the local improvement in Phase i. For any i ^ 5, 6, we set Si in the 
same way as in Theorem 2. For i = 5,6, we set Si ~ [■?■] . 

The algorithm clearly runs in polynomial time. The approximation ratio of PRPSLI 
is presented in the following main theorem. For completeness, we also state the approx- 
imation ratio for the 3-Set Cover problem, which is obtained by Duh and Fiirer [2| and 
remains the best result. Let pk be the approximation ratio of the fc-Set Cover problem. 

Theorem 3 (Main). For all e > 0, the Packing-based k-Set Cover algorithm has an 
approximation ratio pf. = 2Hk — H k + | — ^-j- — | + e for even k and k > 6; 

Pk = 2Hk — Hk-i — | + e for odd k and k > 7; p 5 = 1.7333; p4 = 1.5208; p% = |. 
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Algorithm 1 Packing-based fc-Set Cover Algorithm (PRPSLI) 
// The k-Set Packing Phase 
for i k down to 7 do 

Select a maximal collection of disjoint i-sets. 

repeat 

Select p < Si i-sets and replace them with p + 1 i-sets. 
until there exist no more such improvements, 
end for 

// The Restricted k-Set Packing Phase 

Run the semi-local (2, 1) -improvement algorithm for 3-Set Cover on the remaining uncovered 
elements to obtain the number of 1-sets. 
for i «— 6 to 4 do 
repeat 

Try to replace p < s, i-sets with p + 1 i-sets. Commit to the replacement only if the 
number of 1-sets computed by the semi-local (2, 1) -improvement algorithm for 3-Set 
Cover on the remaining uncovered elements does not increase, 
until there exist no more such improvements, 
end for 

// The Semi-Local Optimization Phase 

Run the semi-local (2, 1) -improvement algorithm on the remaining uncovered elements. 



Remark 1. For odd fc > 7, the approximation ratio p k is derived from the expression 

Pk = | H l~i + i + l + e - We can further obtain the asymptotic representation of p k , 

i.e., Pk = 2H k -H h - 1 -± + e = H k + \n2-^ + 0{\) + € = H k - 0.6402 + 0(|) + e. 

Similarly, for even fc > 6, Pk = l + k zj + 1 ^-\ h § + 3 + 1 + 6 = 2H k ~ H ^ + 

f-fe ± T-!+ e = ^+ ln2 -! + °d) +£ = H k - 0.6402 + 0(±) + e. Finally, 
p 5 = | + i + landp4 = ^ + ^ + l. 

Remark 2. Restriction on Phase 6 is only required for obtaining the approximation ratio 
p k for even k and k < 12. In other cases, only restriction on Phase 5 and Phase 4 are 
necessary. 

We prove the main theorem in Section 4. Before that, we analyze the approximation 
ratio of the Restricted fc-Set Packing algorithm for fc > 4 in Section 3. We state the 
result of the approximation ratio of the Restricted fc-Set Packing algorithm as follows. 

Theorem 4 (Restricted fc-Set Packing). There exists a Restricted 4-Set Packing algo- 
rithm which has an approximation ratio jg. For all e > and for any fc > 5, there 
exists a Restricted k-Set Packing algorithm which has an approximation ratio | — e. 

Remark 3. Without loss of generality, we assume that optimal solution of the Restricted 
fc-Set Packing problem also has the property that it does not increase the number of 1- 
sets needed to finish the cover of the remaining uncovered elements. This assumption 
is justified by Lemma 2 in Appendix 9.1. 
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3 The Restricted fc-Set Packing Algorithm 

We fix one optimal solution of the Restricted fc-Set Packing algorithm. We refer to 
the sets in & as optimal sets. For fixed s, a local improvement replaces p < s fc-sets with 
p+1 fc-sets. We pick a packing of fc-sets srf that cannot be improved by the Restricted fc- 
Set Packing algorithm. We say an optimal set is an i-level set if exactly i of its elements 
are covered by sets in srf '. For the sake of analysis, we call a local improvement an 
i-j -improvement if it replaces i sets in srf with j sets in G. As a convention in the 
rest of the paper, small letters represent elements, capital letters represent subsets of 
U, and calligraphic letters represent collections of sets. We first introduce the notion of 
blocking. 

3.1 Blocking 

The main difference between unrestricted fc-set packing and restricted fc-set packing is 
the restriction on the number of 1-sets which are needed to finish the covering via the 
semi-local (2,l)-improvement. This restriction can prohibit a local improvement. If any 
i-j-improvement is prohibited because of an increase of 1-sets, we say there exists a 
blocking. In Example 1 given in Appendix Section 6.1, we construct an instance of 
4-set packing to help explain how blocking works. 

We now define blocking formally. We are given a fixed optimal fc-set packing 
of U and a fc-set packing stf chosen by the Restricted fc-Set Packing algorithm. We 
consider all possible extensions of srf to a disjoint cover of U by 1-sets, 2-sets and 3- 
sets. We order these extensions lexicographically, first by the number of 1-sets, second 
by the total number of 2-sets and 3-sets which are not within a fc-set of 0, and third 
by the number of 3-sets which are not within a fc-set of &. We are interested in the 
lexicographically first extension. Notice that we pick this specific extension for analysis 
only. We cannot obtain this ordering without access to G. We explain how we order the 
extensions in Example 2 (Appendix Section 6.2). 

Suppose we finish the cover from the packing srf with the lexicographically first 
extension. Let & be an undirected graph such that each vertex in & represents an 
optimal set. Two vertices are adjacent if and only if there is a 2-set in the extension 
intersecting with the corresponding optimal sets. Since the number of 2-sets and 3-sets 
not within an optimal set is minimized, there are no multiple edges in the graph. For 
brevity, when we talk about a node V in we also refer to V as the corresponding 
optimal set. Moreover, when we say the degree of a node V, we refer to the number of 
neighbors of V. 

Proposition 1. & is a forest. 

Proposition 2. For any i < fc — 1, there is no 1-set inside an i-level set. i.e. 1-set can 
only appear in (fc — l)-level sets. 

Proposition 3. For any tree ST in there is at most one node which represents an 
i-level set, such that the degree of the node is smaller than fc — i. 



6 



Martin Fiirer and Huiwen Yu 



For any tree, if there exists a node with property in Proposition 3, we define it to be 
the root. Otherwise, we know that all degree 1 nodes represent (k — l)-level sets. We 
define an arbitrary node not representing a (k — 1) -level set to be the root. If there are 
only (k — l)-level sets in the tree, i.e. the tree degenerates to one edge or a single point, 
we define an arbitrary (k — l)-level set to be the root. All leaves represent (k — l)-level 
sets. (The root is not considered to by a leaf.) We call such a tree a blocking tree. For 
any subtree, we say that the leaves block the nodes in this subtree. We also call the set 
represented by a leaf a blocking set. 

We consider one further property of the root. 

Proposition 4. Let k > 4. In any blocking tree, there exists at most one node of either 
O-level or 1-level that is of degree 2. If such a node exists, it is the root. 

The proofs of Proposition 1 to 4 are given in Appendix Section 6.3. 
Based on these simple structures of the blocking tree, we are now ready to prove 
the approximation ratio of the Restricted fc-Set Packing algorithm. 

3.2 Analysis of the Restricted 4-Set Packing Algorithm 

We prove in this section that the Restricted 4-Set Packing algorithm has an approxima- 
tion ratio jg. We first explain how this ^ ratio is derived. We use the unit U defined in 
Example 1 (Appendix Section 6.1). Assume when the algorithm stops, we have n>l 
copies of U and a relatively small number of 3-level sets. We denote the i-th copy of U 
by Hi. For each i and 1 < j < 12, the set Oj in Ui and Ui+i are adjacent. This chain 
of Oj's starts from and ends at a 3-level set respectively. Then the performance ratio of 
this instance is slightly larger than j^. We first prove that the approximation ratio of the 
Restricted 4-Set Packing algorithm is at least j^. 

Given a collection of blocking trees. We assign 4 tokens to every element cov- 
ered by sets chosen by the restricted packing algorithm. We say a set has a free token 
if after distributing the token, this set retains at least 7 tokens. We show that we can 
always distribute the tokens among all the optimal sets &, so that there are at least 7 
tokens in each optimal set. 

Proof. We present the first round of redistribution. 

Round 1 - Redistribution in each blocking tree 3~ . Every leaf in & has 4 free 
tokens to distribute. Every internal node V of degree d requests A(d — 2) tokens from a 
leaf. We consider each node with nonzero request in the reverse order given by breadth 
first search (BFS). 

- If d = 3, V requests 4 tokens from any leaf in the subtree rooted at V which has 12 
tokens. 

- If d = 4, V has three children V\, V2, V3. V sends requests of 4 tokens to any leaf 
in the subtree rooted at V\ , V2, V3, one for each subtree. V takes any two donations 
of 4 tokens. 

- The root of degree r receives the rest of the tokens contributed by the leaves. 
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Proposition 5. After Round 1, every internal node in has at least 8 tokens, the root 
of degree r has 4r tokens. 

Proposition 5 is proved in Appendix Section 7.1. According to Proposition 4, we 
know that after the first round of redistribution every node has at least 8 tokens except 
any 0-level roots which are of degree 1 and any singletons in & which are 1 -level sets. 

We first consider the collection of 1 -level sets which are singletons in & . Let 
S\ be such a 1 -level set that intersects with a 4-set A chosen by the algorithm. Assume 
A also intersects with j other optimal sets {Oj}^ =1 . 

We point out that no Oi belongs to 5?\. Otherwise suppose Oi G 5?\. Then there is 
a 1-2-improvement (replace A with Si and Oi). 

We give the second round of redistribution, such that after this round, every set in 

has at least 7 tokens. For optimal sets O, W, consider each token request sent to O 
from W. We say it is an internal request, \fW G and W and O intersect with a set 
A chosen by the algorithm. Otherwise, we say it is an external request. 

Round 2 - Redistribution for Si G S?\. Si sends r requests of one token to Oi if 
\Oi f] A\ = t. For each node V in internal requests are considered prior to external 
requests. 

For each request sent to V from W, 

- If V has at least 8 tokens, give one to W. 

- If V has only 7 tokens. 

(1) If V is a leaf, it requests from the node which has received 4 tokens from it 
during the first round of redistribution. 

(2) If V is not a leaf, 

(2.1) If W G S^i, V requests a token from a leaf which has at least 8 tokens in the 
subtree rooted at V. 

(2.2) If W is a node in Suppose V has children V\,..,Vd and W belongs to 
the subtree rooted at V\. V requests from a leaf which has at least 8 tokens in the 
subtree rooted at V2, Vd- 

We now prove the correctness of the second round of redistribution. 

Proposition 6. Every singleton node O of level j has at most j • — 1 requests. Every 
leaf has at most k — 2 internal requests. Every internal node of level j has at most j 
requests. The root of level s has at most s requests. 

Proposition 7. Singleton node O can satisfy all the requests. 

Proposition 8. The root R of level s and degree r can satisfy all the requests ifs + r > 
2. 

Proposition 9. There is no external request sent to a root which is of level and degree 
1. 

Proposition 10. Any external request sent from a leaf L can be satisfied. 
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Proposition 11. Any external request sent from an internal node of degree d > 3 can 
be satisfied. 

Proposition 12. Any external request sent from an internal node V of degree 2 can be 
satisfied. 

The proofs of Proposition 6 to 12 are given in Appendix Section 7.1. From Propo- 
sition 7, 8, 10, 11 and 12, we know that all requests can be satisfied. Hence after the 
second round of distribution, every set in has at least 7 tokens. And from Proposi- 
tion 9, we know that every root of level and degree 1 retains 4 tokens. 

We consider a root R which is a 0-level set of degree 1 . R receives 4 tokens from leaf 
B. Assume B is covered by £ stf and {A^}^ =1 intersect with {Oi]\ =1 £ 6 '. 

We first prove that, 

Proposition 13. VO £ {Oi}\ =1 , O does not receive any token request during Round 2 
of redistribution. 

The proof of Proposition 13 is given in Appendix Section 7.1. Based on Proposition 
13, for any set O £ {Oi}\ =1 , we can think of a token request from R to B as an internal 
request to O. We describe the third round of redistribution. 

Round 3 - Redistribution for root of level and degree 1. Request 1 token from 
each of 0±, 0/ following Round 2. 

Proposition 14. I > 3. 

The correctness of Round 3 follows from Proposition 14. The proof of Proposition 
14 is given in Appendix Section 7.1. We thus prove that a root of level and degree 1 
has 7 tokens after the third round of redistribution. 

Therefore, after the three rounds of token redistribution, each optimal set has at least 
7 tokens, then the approximation ratio of the Restricted 4-Set Packing algorithm is at 
least rs, □ 

16 

We give the construction of tight example in Appendix Section 7.2. We thus con- 
clude that the approximation ratio of the Restricted 4-Set Packing algorithm is j^. 

3.3 Analysis of the Restricted fc-Set Packing Algorithm, k > 5 

For k > 5, we prove that the approximation ratio of the Restricted fc-Set Packing algo- 
rithm is the same as the set packing heuristic [7|. For fixed s, a local improvement can 
replace at most s sets with s + 1 sets. We prove that for any e > 0, there exists an s, 
such that the approximation ratio of the Restricted fc-Set Packing algorithm is at least 

The proof strategy is similar to that of the Restricted 4-Set Packing algorithm. We 
first create a forest of blocking trees & ' . We give every element covered by sets chosen 
by the algorithm one unit of tokens. We then redistribute the tokens among all the 
optimal sets. We claim that for s > after redistribution, every optimal set gets at 
least 2 — fce units of tokens. We use a different parameter of local improvement from 
Theorem 2. The algorithm still runs in polynomial time. 
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Proof. The first round of redistribution goes as follows. 

Round 1 - Redistribution in each blocking tree Every leaf in ST has k — 3 
units of free tokens to distribute. Every internal node V of degree d receives d — 2 units 
of tokens from a leaf. The root receives the remaining tokens. 

Proposition 15. After Round 1, every node has at least 2 units of tokens, except single- 
tons which are 1 -level sets. 

We consider the collection of 1 -level sets 5?\ which are singletons in Let Si 
be such a 1 -level set that intersects with a fc-set A chosen by the algorithm. Assume A 
intersects with j other optimal sets {Qi} 3 i=1 - 

Round 2 - Redistribution for Si G S?\. 

- For every singleton node of level i (i > 3): 

Send 1 unit of tokens each to arbitrarily i — 2 internal requests. 

- If Si receives one unit of tokens, we are done. Otherwise, pick an arbitrary single- 
ton node Qi from {Qi}j =1 . Let Qi = 0\. Let A\ G srf be a set intersecting with 
Qi while it does not intersect with any set in 5f\ . (The existence of A\ follows from 
Proposition 6.) 

- If A\ intersects with some singleton node which has at least 3 units of tokens, we 
move 1 unit to Si . Otherwise pick an arbitrary singleton node O2 which intersects 
with A\. Let A2 G si be a set intersecting with O2 while it does not intersect with 
any set in 5?\. 

- Repeat this procedure and form a chain of singleton sets 0\ — Qi, O2, O p such 
that Si and 0\ intersect with A = A$, Oi and Oj+i intersect with a set Ai chosen 
by the algorithm, for i = l,..,p— 1, until 

(1) The chain ends when excluding O p , every set intersecting with A p -i is not a 
singleton node. Denote the collection of such Si by y^. 

Move 1 unit of tokens from any node in the collection of non-degenerate blocking 
trees which has at least 3 units of tokens to Si. 

(2) The chain ends where there exists a 1 -level set S[ G S"i which intersects with 
A', such that S[ starts another chain of length q which ends at O p (= O q ) (as 
illustrated in Fig. 4 (Appendix Section 8.1)). 

Construct a graph , such that every vertex in the graph represents a node in a 
chain, V\ , V2 are connected, if the corresponding nodes intersect with a set A cho- 
sen by the algorithm. Consider every connected component ^ in Sf. Equally dis- 
tribute all tokens to every vertex in ^ . 

The correctness of step (1) and step (2) follows from Proposition 16 and 17. We 
give the proofs in Appendix Section 8.2. 

Proposition 16. The collection of non-degenerate blocking trees has at least \ 5^ \ free 
units of tokens. After step (1), every set in y T has 2 units of tokens. 



Proposition 17. After step (2), every optimal set has at least 2 — ke units of tokens. 
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We conclude that after the second round of redistribution, every optimal set gets at 
least 2 — fee units of tokens. Therefore, the approximation ratio of the Restricted fc-Set 
Packing algorithm is at least | — e. □ 

Moreover, the tight example of the fc-set packing heuristic Q can also serve as a 
tight example of the Restricted fc-Set Packing algorithm for fc > 5. Hence, the approxi- 
mation ratio of the Restricted fc-Set Packing algorithm is \ — e. 



4 Analysis of the Algorithm PRPSLI 

We use the factor-revealing linear program introduced by Jain et al. [8| to analyze the 
approximation ratio of the algorithm PRPSLI. Athanassopoulos et al. 1 1 1 first apply this 
method to the fc-Set Cover problem. Notice that the cover produced by the restricted 
set packing algorithms is a cover which minimizes the number of 1-sets. In Appendix 
Section 9.1, we first show that for any fc > 4, there exists a fc-set cover which simulta- 
neously minimizes the size of the cover and the number of 1-sets in the cover. We then 
present the set-ups and notations of the factor-revealing linear program (LP). 

The proof of Theorem 3 is similar as the proof of Theorem 6 in QJ. Namely, we 
find a feasible solution to the dual program of (LP), which makes the objective function 
of the dual program equal to the value of pk defined in Theorem 3, thus p^ is an upper 
bound of the approximation ratio of PRPSLI. We give the proof of an upper bound of 
the approximation ratio of PRPSLI in Appendix Section 9.2. On the other side, we give 
an instance for each fc, such that PRPSLI does not achieve a better ratio than pk on this 
instance in Appendix Section 9.3. We thus prove Theorem 3. 
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Appendix 

5 Comparison on the Approximation Ratio of the fc-Set Cover 
Problem with Previous Works 



Table 1. Comparison on the Approximation Ratio of the fc-Set Cover Problem 



k 


GRSLI k , 5 [2| 


1 10| 


PRSLI k , 5 1 1 1 


PRPSLI 


3 


1.3333 


1.3333 


1.3333 


1.3333 


4 


1.5833 


1.5808 


1.5833 


1.5208 


5 


1.7833 


1.7801 


1.7833 


1.7333 


6 


1.9500 


1.9474 


1.9208 


1.8667 


7 


2.0929 


2.0903 


2.0690 


2.0190 


8 


2.2179 


2.2153 


2.1762 


2.1262 


9 


2.3290 


2.3264 


2.2917 


2.2413 


10 


2.4290 


2.4264 


2.3802 


2.3302 


20 


3.0977 


3.0952 


3.0305 


2.9779 


21 


3.1454 


3.1428 


3.0784 


3.0284 


50 


3.9992 


3.9966 


3.9187 


3.8683 


75 


4.4014 


4.3988 


4.3178 


4.2678 


100 


4.6874 


4.6848 


4.6021 


4.5520 


large k 


H k - 0.5 


H k - 0.5026 


H k - 0.5902 


H k - 0.6402 



6 Section 3.1 



6.1 Example 1 

Example 1 (Blocking). Consider an instance (U, 5?) of the 4-Set Cover problem. Sup- 
pose there is an optimal solution 6 of the Restricted 4-Set Packing algorithm which 
consists of only disjoint 4-sets that cover all elements. Let 6 = {Oj}^ U{-^-t}i=i> 
m > 12. Let {A;}J =1 be a collection of 4-sets chosen by the algorithm. {Oi}}^ is 
a collection of 1 -level or 2-level sets. Denote the j-th element of Oi by o\, for j = 
1, 2, 3, 4. If Ai = (o^ 1 , , , of*), we say that A t covers the elements o\ * , o-*, o^ 
and of 4 . Denote the following unit by U, 



'A 1 = (c 


'i,ol,ol,o\ 3 ) 


A 2 = (c 




A 3 = (c 


4> 7:°11.°15) 


Ai = (c 




A 5 = (c 


'i,o|,o|,o|) 


A 6 = (c 


>§iO§,07,°§) 


K A r = (c 


>1, Of o.O?!, 0? 2 ) 
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We visualize this construction in Fig. 1. Let each cube represent a 4-set in {Ai}J =1 
and we place {Oi}}^ vertically within a 4 x 4 square (not shown in the figure), such 
that each Oi intersects with one or two sets in {Ai}J =1 . {Ai}J =1 are placed horizontally. 
Notice that O13, On, O15, 0\§ which intersects with A\, A2, A3, A4 respectively are 
1 -level sets. The other 12 sets in {Oi}}^ are 2-level sets. 

{Bi\ r [L l is a collection of 3-level sets. Notice that for our fixed optimal solution, all 
elements can be covered by 4-sets, so there is no 1-set needed to finish the cover. For 
given 5?, when we compute an extension of the packing to a full cover via the semi- 
local (2,l)-improvement, assume the unpacked element of Bi (1 < i < 12) can only be 
covered by a 2-set intersecting with both Bi and Oi, or it introduces a 1-set in the cover. 
The remaining unpacked elements of {Oi}}^ and {-Bj}£t=i can be covered arbitrarily 
by 2-sets and 3-sets. In unrestricted packing, one of the local improvements consists 
of replacing Ai, A 2 , A3, A 4 , A 5 by 0\, 2 , O3, O4, O13, O14, O15, 0\q. However, in 
restricted packing, for 1 < i < 12, adding any Oi to the packing would create a 1-set 
covering the unpacked element of Bi during the semi-local (2,l)-improvement. Hence 
this local improvement is prohibited as a result of restricting on the number of 1-sets. 

We remark that blocking can be much more complicated than in this simple exam- 
ple. As we shall see later in Section 3.2, for the Restricted 4-Set Packing problem, a 
3-level set can initiate a blocking of many optimal sets. 




Fig. 1. Placement of Ax to A 7 



6.2 Example 2 

Example 2 (Finish the cover by 1-sets, 2-sets and 3-sets). In Fig. 2, Fig. 3 and Fig. 4. 
Rectangles placed vertically represent optimal sets. Circles represent 1-sets. Ellipses 
placed horizontally represent 2-sets or 3-sets, where the smaller ones stand for 2-sets 
and the larger ones stand for 3-sets. The cross symbol represents an element covered by 
a fc-set in srf ' . Let (m, n 2 , n 3 ) be an ordered pair, such that, ni is the number of 1-sets, 
712 is the total number of 2-sets and 3-sets which are not within one optimal set, and 
is the number of 3-sets which are not within one optimal set. These 1-sets, 2-sets and 
3-sets are used to finish the cover. The right picture is always a cover which is before 
the cover in the left picture in the lexicographic order. 
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Fig. 2. (2,1,0) =► (2,0,0) 





Fig. 4. (0,2,1)^(0,2,0) 
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6.3 Proofs in Section 3.1 



Proof (Proposition 1). It is sufficient to show that there is no cycle in J?. Suppose 
V\, Vi, Vi form a cycle in We remove the 2-sets intersecting with adjacent nodes 
in the cycle and add the 2-sets inside each Vi, for i = 1, I. Then the total number of 
2-sets and 3-sets not within an optimal set decreases. □ 



Proof (Proposition 2). Let V\ be an i-level set which is covered by a 1-set, for i < k—1. 
If there are more than one 1-set covering V\, we can replace them with 2-set or 3-set 
inside V\. Hence we only consider the case that there is only one 1-set covering V\ . 
If there is a 2-set or 3-set inside V\, we can remove the 1-set and replace the 2-set or 
3-set with a 3-set or two 2-sets respectively. If V\ is a singleton node, since i < k — 1, 
there exists a 2-set or 3-set inside V\. Otherwise let V be any node of degree 1 which 
connects to V\ via a simple path Vi, V2, Vj_i, VJ. We remove all 2-sets intersecting 
with adjacent nodes in this path and move the 1-set from V\ to Vi, then add 2-sets inside 
Vi, Vi-i. If there is a 1-set in any Vi, 2 < i < I — 1, it can be replaced together 
with the 2-set just added inside Vi with a 3-set. If there is another 1-set in Vi, it can 
be combined with the 1-set moved from V\ to a 2-set. Hence, there is no 1-set in any 
i-level set for i < k — 1. 

However, 1-set can remain in (k — l)-level set. In this case, the (k — l)-level set is a 
singleton node in & . □ 



Proof (Proposition 3). We consider non-degenerate tree. Suppose there are two nodes, 
an i-level set V\ and a j-level set Vi which satisfy the requirements. Consider a simple 
path connecting V\ and Vi. We remove all 2-sets intersecting with adjacent nodes on 
this simple path. For those nodes excluding V\ , Vi on the path, we can add a 2-set inside 
each optimal set. For V\, since V\ is not a (k — l)-level set, there is no 1-set inside V\. 
Moreover, the degree of V\ is smaller than k — i, hence there is a 2-set or 3-set inside 
V\, we can then replace it with a 3-set or two 2-sets inside V\ respectively. The same 
argument applies to Vi. Hence, there can be at most one i-level set with degree smaller 
than k — i. □ 



Proof (Proposition 4). We first prove that there is at most one node of either 0-level or 
1 -level which is of degree 2. Assume V\ and Vi are 0-level or 1 -level sets and of degree 
2. We replace the 2-sets intersecting with two adjacent nodes along the simple path 
connecting V\ and Vi with 2-sets or 3-sets inside the optimal sets represented by these 
nodes, then the total number of 2-sets and 3-sets not within an optimal set decreases. In 
this way, V\ and Vi turn to be of degree 1 . 

Suppose we have one 0-level or 1 -level set V\ of degree 2. We prove that there is 
no node of degree 1 representing an i-level set for any i < k — 2, thus V\ is the root. 
Assume W is an i-level set of degree 1. We replace the 2-sets intersecting with two 
adjacent nodes along the simple path connecting V\ and W with 2-sets or 3-sets inside 
the optimal sets represented by these nodes, then the total number of 2-sets and 3-sets 
not within an optimal set decreases. In this way, V\ turns to be of degree 1 and W turns 
to be of degree 0. □ 
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7 Section 3.2 

7.1 Proofs in Section 3.2 

Proof (Proposition 5). Consider any subtree with root V of degree d + 1, d > 2 (V is 
not the root of Since any internal node of degree 2 does not request any token, for 
simplicity we assume there is no internal node of degree 2 in the subtree. Assume V 
has children Vi, Vj,. If every child of V is a leaf, since the request is processed in the 
reverse order given by BFS, we know that there is no other node requesting tokens from 
Vi, Vd, hence V's requests can be satisfied. Assume that in every subtree rooted at 
Vi, Vd, all the requests have been satisfied. Since for any tree with root of degree r, 
the quantity 

]T (d v -2) + r. (7.1) 

V is an internal node 

equals to the number of the leaves of the tree. We know that there are 4 free tokens in 
each subtree rooted at Vi, .., Vd- Hence, V's requests can be satisfied. 

By ( 17. U . the root of 2T receives 4r tokens. □ 

Proof (Proposition 6). Suppose a singleton node O is covered by {A$}' =1 . If every Ai 
intersects with Si 6 S'x, we have a local improvement by replacing A\,...,Ai with 
Si, Si and O. Hence, {Aj}' =1 intersect with at most I — 1 < j — 1 sets in 5?i. O has 
at most j — 1 requests of tokens. Similarly, we know that a leaf is a (fc — l)-level set, it 
has at most k — 2 internal requests. 

Every internal node of level j can have j requests, and the root of level s can have 
s requests. □ 

Proof (Proposition 7). Assume O is of level j and j > 2. From Proposition 6, O has at 
most j — 1 request. After giving j — 1 tokens, O has 3 j + 1 > 7 tokens left. Hence, O 
can satisfy all the requests. □ 

Proof (Proposition 8). The root has 4(s + r) tokens and it has at most s + r requests. 
If it has at most s + r — 1 requests, it retains 3s + 3r + 1 > 7 tokens after satisfying all 
the requests. 

It remains to prove that it cannot have s + r requests. Otherwise, suppose R is 
covered by and every Ai intersects with Si £ S\. There are leaves Li, L r 

which request token from R for 1 -level singleton S[, S' r . S^ intersects with A[ £ si ' . 
Then we have a (I + r)-(l + r + 1) -improvement (replace Si, Si, S[, S' r with 
Ai,...,Ai,A[,...,A' r mdR). □ 

Proof (Proposition 9). If such a root has an external request from V, then V is a leaf 
and V has an internal request from an S G S\, S intersects with an A E si ', Then we 
have a 1-2-improvement (replace A with S and the root). □ 

Proof (Proposition 10). By Proposition 6, L makes an external request if and only if it 
has 2 internal requests. As in Step (1), L sends request to V which has received 4 tokens 
from it. If V has at least 8 tokens, it gives one to L. Otherwise, we proceed with Step 
(2.2). Hence, it is sufficient to prove that in (2.2), there exists a leaf in some subtree 
rooted at V2, Vd which has 8 tokens. 
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Otherwise, suppose any leaf in the subtree rooted at V 2 , Vd has an internal re- 
quest. We pick L 2 , .., Ld belonging to the subtree rooted at V 2 , .., Vd respectively. Lj 
has an internal request from Si which intersects with Ai G srf . Moreover, we know 
that L has an internal request from S intersecting with A 6 srf . We have a d-(d + 1)- 
improvement (replace A2 , . . . , Ad , A with S2 , ■ ■ ■ , Sd , S and V) . □ 

Proof (Proposition 11). An internal node of degree at least 3 sends a request if and only 
if it receives a request from a leaf but fails to satisfy it. The proof is indeed contained in 
the proof of Proposition 10. □ 

Proof (Proposition 12). If V makes an external request, by Proposition 6, V has two 
internal requests, say from Si, <!32. Si intersects with A\ 6 srf, S2 intersects with A 2 E 

si. 

Let X be a closest node to V which belongs to the subtree rooted at X and has 
a degree d > 3. If no such X exists, let X = V and d = 2. If on the contrary, V's 
request cannot be satisfied, we pick a leaf in each subtree rooted at a child of X. Let 
L\, Ld-i be these leaves, then Li has an internal request from 5- intersecting with 
A' { e srf. We then have a local improvement by replacing A[, A' d _ 1 ,Ai,A 2 with 
Si,...^^, S U S 2 and V. ' □ 

Proof (Proposition 13). First, VO ^ S?\. Otherwise, assume O intersects with A e 
we have a 1-2-improvement (replace A with O and R). Hence, O does not have any 
internal request. 

O does not have any external request either. Recall in Round 2 of redistribution, a 
leaf has an external request if there exists an internal node V of degree d, such that O 
belongs to the subtree rooted at V, and 

- If d = 2, V has two internal requests from Si, S2, Si, S2 intersect with Ai,A 2 G &/ 
respectively. However in this case, we have a 3-4-improvement (replace A, Ai , A 2 with 

R,S U S 2 ,V). 

- If d = 3, there is an external request sent to V from leaf L\,L\ has an internal request 
from Si which intersects with Ai e and there is an internal request sent to V from 
5*2. Let S2 intersect with A 2 E stf '. We have a 3-4-improvement (replace A, Ai,A 2 with 
R,S U S 2 ,V). 

- If d = 4, there are two external requests sent to V from leaves Li,L 2 . Li,L 2 have 
internal requests from Si, S 2 - Si, S2 intersect with A\,A2 £ srf respectively. There is 
a 3-4-improvement (replace A, A\,A 2 with R, Si,S 2 , V). 

Hence, O does not have any token request during Round 2 of redistribution. □ 

Proof (Proposition 14). First, j > 2. Otherwise, we have a 1-2-improvement (replace 
Ai with B and R). 

If j = 2, then the number of elements contained in {Oi}\ =1 n{^j)i=i is 2 • 4 — 3 = 5. 
Hence I > 2. If I = 2, there exists an Oi which is a 3-level set and completely covered 
by {Ai} 3 i=1 . However in this case, we have a 2-3-improvement (replace Ai,A 2 with 
O t ,B,R). Hence, / > 3. 

If j = 3, then the number of elements contained in {Oi)\ =1 {~\{Ai}l =1 is 3 • 4 — 3 = 9. 
In this case, I > 3. □ 
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7.2 Tight example of the Restricted 4-Set Packing algorithm 

We construct an example showing that for any fixed s, which is the parameter of the 
local improvement, and any e > and there exists an instance, on which the Restricted 
4-Set Packing algorithm has a performance ratio at most jg + e. We thereby conclude 
that the approximation ratio of the Restricted 4-Set Packing algorithm is jg. 

Our construction is randomized. We take copies of the unit U defined in Example 
1. Then there are n 2-level sets. Suppose there are m 3-level sets which start the block- 
ing. In case of 4-set packing, blocking can start from a single 3-level set So, then propa- 
gate through an arbitrary number of 2-level sets S% , S2, Si (i > 1). More specifically, 
we form the blocking by covering the remaining uncovered elements of So, Si, Si 
by 2-sets Ti,T%, Tj, such that Tj intersects Sj-i and Sj, for j = 1, 2, i. 

In our example, we assign each 2-level set to one of the m blocking sets inde- 
pendently and uniformly at random. If a 2-level set is assigned to a blocking set, that 
blocking set is a leaf of the blocking tree containing the 2-level set. For fixed s, we 
consider local p-(p + 1) -improvements for any p < s. For each subset of srf with size 
p, suppose after removing one covering set of A blocking sets, we can replace these 
p sets with at most q optimal sets. Then q < =|2 < If q — p — A > 1, i.e., 
A<q — p — 1<^<2s, there is a local improvement. 

We assume n 3> s. Let t be the maximum number of 2-level sets which can be added 
to the solution by the local improvement. Then t < il 2 . Let &i be the collection of 2- 

level sets. Let {Ei, £2, £n} be a set of random variables, where N = Yli=i (") ~ n * 
is the number of nonempty subsets of G-i with size at most t. Assume we enumerate 
every 2-level sets and arrange all subsets of 62 lexicographically. Let £j be the event 
that there exists a local improvement which adds the i-th subset with size at most t of 
62 to the solution. Let Y be the number of blocking sets assigned to the 2-level sets in 
this subset. Then, 

Pr(&) < Pr(F < 2s) = Pr <X = A) • (7.2) 

Since the assignments of 2-level sets to blocking sets are independently and uni- 
formly at random, we bound (7.2) as follows, 



< (2s) N ■ m-^" 28 ) w {-) N . (7.3) 
m 

Assume that m s. Since each depends on less than N ps n* elements in 
{£i}?Li, and N ■ Pr(£ 4 ) = o(l). By the following lemma, we have Pr(f|^i ~£~i) > 0. 

Lemma 1 (Corollary 5.12 [11 1). Let {£1, £2, £ n } be events in a probability space 
with Pr(£j) < p for all i. If each event is mutually independent of all other events 
except for at most d, and ep(d + 1)<1, then Pr(p|" =1 £i) > 0. 
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Therefore, there exists an assignment of 2-level sets to blocking sets such that no 
local p-(p + 1) -improvement is possible, for p < s. 



Assume all 3-level sets but a constant few of them are blocking sets. The perfor- 
mance ratio of the Restricted 4-Set Packing algorithm on this instance is 7 " +9r " — 
+ 0(~). The ratio tends arbitrarily close to when n — » oo. 



16n+12m 



8 Section 3.3 
8.1 Figure 5 



Si 



Ox 



o. 



o P ----o, o' l 





Fig. 5. Rectangles represent optimal fc-sets. Ellipses represent fc-sets chosen by the al- 
gorithm. 



8.2 Proofs in Section 3.3 

Proof (Proposition 15). According to Proposition 4 and (17. U . we know that after Round 
1, every node has at least 2 units of tokens except singletons which are 1 -level sets. 
Note that contrary to the Restricted 4-Set Packing problem, in restricted fc-set packing 
for k > 5, every leaf contributes at least 2 units of tokens. Hence, even if a root is of 
level and degree 1, it can still receives at least 2 units of tokens after the first round of 
redistribution. 

We remark that we do not care about from which leaf an internal node or the root 
receive the tokens. □ 

Proof (Proposition 16). Let N\ = \S?y\. Without loss of generality, assume there is 
only one non-degenerate tree 3? '. Let 3T have an s-level root of degree r and a set of 
internal nodes with degree set {d v }v is an internal node- 

From ( 17. U . we know that the leaves contribute (k — 3) ^{dy — 2) + (k — 3)r 
units of tokens. Here the summation goes through the set of internal nodes. There are 
J2(d v -2 + k-d v -2) + (k~4)J2(d v -2) + (k-3)r + s-2,i.e. (k-4)J2(d V - 
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1) + (k — 3)r + s — 2 free units of tokens in ST which can be distributed to sets in J^ T . 
Assume on the contrary that 



We derive an upper bound on Ni ■ 

We first claim that any \Ai f] Oj\ < 1. If \Ai f] Oj\ > 2, we know that Oj has at 
least 3 units of tokens. 

If Ap^i — A, the k — 1 elements of A which do not intersect with S\, intersect with 
ST . If A p -i / A, for each S%, there are k—1 corresponding elements intersecting with 
ST , Here, these k — 1 elements belong to A v -\. Hence, & covers at least (k — l)Ni 
elements. On the other hand, there are (k — l)[£2(dv — 2) + r] + — dy) + s 
elements covered by 3" . We have, 

(k - l)JVi < (k - l)[^2(d v - 2) + r] + J2( k ~dv) + s. (8.2) 

Combining (ED and we get (k - l)[(k - 4) J2( d v - 1) + (k - 3)r + s - 2] < 
(k — — 2) + r] + ~ d v) + s , which implies 

J2( k2 - 6fc + 6)(d v - 1) + (k 2 ~5k + 5)r + (k- 2)s - 2(k - 1) < . (8.3) 

(18.3b only holds for the case that k = 5, r = 1, s = and there are at most 2 internal 
nodes. In this case, we observe that the leaf cannot intersect with any set A which 
intersects with S G or we have a 1-2-improvementby replacing A with S and the 
root. Hence, we modify (18.21 i and get 



Combining ( 18.4b and ( 18. U . we have ^(Sdy — 9) < 0. Since dy > 2, it leads to a 



Proof (Proposition 17). Let e be the number of edges and v be the number of vertices 
in^. 

If ^ contains a circle, then e > v. The average number of tokens in ^ is — > 2. 

If ^ is a tree, then e = u — 1, We claim that e > s + 1. Otherwise, there exists 
a e-v-improvement (replace all the sets corresponding to edges in ^ with all the sets 
corresponding to vertices in c /o). The average number of tokens in^is— > 2 — - > 
2- > 2-kefors> 

Hence, in any case, after collecting all tokens of ^ then equally distributed among 
every vertex in % ', each vertex gets at least 2 — ke units of tokens. □ 

9 Proof of Theorem 3 

9.1 Set-up of the Factor-revealing LP 

We first prove that there exists a fc-set cover which simultaneous minimizes the size and 
the number of 1-sets in the cover. 



Ni > (k - 4) J2(d v - 1) + (k - 3)r + s - 2 . 



(8.1) 




(8.4) 



contradiction. 



□ 
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Lemma 2. Suppose there are k-set covers c &, , where c € has b sets and b\ 1-sets, and 
c to" has b' sets and b\ 1-sets. Then there exists a k-set cover c £" which has min(6, b') 
sets and min(6i, b[) 1-sets. 

Proof. We create a bipartite graph G = (V, V',E), where every vertex in V(V) repre- 
sents a set in 'rfCrf' ), and every edge in E represents an element in the universe. Hence, 
the degree of a vertex represents the size of the corresponding set. Two vertices being 
adjacent means the corresponding two sets covers a same element. We show how to find 

c g". 

For simplicity, assume G is connected and b < b'. If bi < b[, take c €" to be c €. 
Otherwise, consider a vertex v\ G V of degree 1 such that its neighbor has degree at 
least 2. If there exists a vertex vi G V of degree at least 3, assume vi is the one with the 
shortest distance to v\. Consider the path v\,v[,..., v' l _ 1 ,vi connecting v\ and vu we 
replace the corresponding sets of vi, «z_i with v[, v[_ 1 and delete the element in 
vi which is covered by both vi and v' l _ 1 . If any v\ (1 < i < I — 1) has degree at least 3, 
delete the elements in v[ which are not covered by v\ , vi-i . In this way, bi decreases 
by 1 while b remains the same. If there is no vi G V which has degree at least 3, since 
bi > b[, and ^ and c €' cover the same universe of elements, b must be greater than b', 
a contradiction. Hence, we can eventually decrease bi to be at most b[. Finally, we take 
c i" to be this modified □ 

We are now ready to set-up the factor-revealing linear program for the fc-set cover 
problem. 

Let (U, S") be an instance of the fc-Set Cover problem, where U is the set of ele- 
ments to be covered, is a collection of sets, and {J Se y S — U. Fori = k, k — 1, ...,3, 
let {Ui,S?i) be the instance for phase i of Algorithm PRPSLI, where Ui is the set of el- 
ements which have not been covered before Phase i and is the collection of sets in 
S which contain only the elements in Ui. Let OPTi be an optimal solution of (Ui, 5?i) 
for i > 7. For i < 6, OPTi is an optimal solution of (Ui,^) with minimal number 
of 1-sets. OPT is an optimal solution of (U, Jf). Let bij be the ratio of the number of 
j-sets in OPTi over the number of sets in OPT. Let Qi be the approximation ratio of 
the set packing algorithm used in Phase i. Let a\ be the ratio of the number of 1-sets 
chosen by the semi-local optimization phase over the number of sets in OPT. Since 
\OPT t \ < \OPT\, we have fori = k,k- 1,...,3, 

i 

I>j<i- (9 - 1} 

In each phase of PRPSLI, the number of i-sets chosen by the algorithm is m = 
im^li. Since C Ui, then |i7A^-i| - 1^1 - Pi-i\ = (E]=iJhj - 

Y^j=i jt>i-i.j)\OPT\. Let Qi be the approximation ratio of the set packing algorithm 
used in Phase i. At the beginning of Phase i, there are bi^\OPT\ z-sets. Thus, 

rii = J - J - (9.2) 

> Qibi,i\OPT\ . (9.3) 
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i.e. 

i— 1 i-1 

X-/ 7 '' ' • Y.' 1 '- ' - - ."•••''•< • • ( 9 - 4 ) 

3=1 3=1 

We consider additional constraints imposed by the restricted phases, namely for 
Phase 6 to 3. Let a,j be the ratio of the number of j-sets chosen by the semi-local 
optimization phase over the number of sets in OPT, for j — 1, 2, 3. In each restricted 
phase, the number of 1-sets does not increase. Hence, for i = 3, 4, 5, 6, 

a\ < b it i . (9.5) 

Next, we obtain an upper bound of the approximation ratio of PRPSLI. From Lemma 
2.3 in J3J, we have a\ + a-i < 63,1 + 63.2 + 63.3- Also notice that a\ + 2a2 + 3a3 = 
63.1 + 263,2 + 363.3. Thus we have an upper bound of 713, namely, 

"■3 = (ai + «2 + a 3 )|0_PT| = (y H 1 )\OPT\ 

< /£l + &3,1 + ^3,2 + 63,3 + 63.I + 263,2 + 3fc>3 - 3 ) |Q J>r| 

3 3 3 

= (~ai + §63,1 + ha + -b 3 , 3 )\OPT\ . (9.6) 



Combining (9.2) and (9.6), we have an upper bound of the approximation ratio of 
PRPSLI as, 



, < > — h -ai + -63 1 + 63 9 + -63 

\OPT\ ~ ^ i 3 3 ' ' 3 

fc . fe— 1 2 



Moreover, 



3=1 i=4 j=l V ' 

15 17 
+ 3 fll + 12 b3A + 2 &3:2 + 12 &3,3 ' < ' 9 ' 7 ' ) 



6i,j > forj = l,..,i; i = k,...,3. (9.8) 
ai > . (9.9) 



Hence, we define the factor-revealing linear program of PRPSLI with objective 
function (9.7) and constraints (9.1), (9.4), (9.5), (9.8), (9.9) as follows, 
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k K 1 Z -j^ p. rj 

max fa +EE JtiTT^i + 3 fll + 12 53 ' 1 + 2 &3 < 2 + 12 h < 3 

i 

s.t. bjj < 1, i = 3, fe, 

i-1 i-1 

ai - < 0, i = 3, .., 6, 
h,j > 0, i = 3, k,j = 1, 

oi > 0. (LP) 
We also prove that 

Lemma 3. For any k > 4, the approximation ratio of Algorithm PRPSLI is upper- 
bounded by the maximized objective function value of the factor-revealing linear pro- 
gram (LP). 

9.2 Finding an Upper bound for the approximation ratio of PRPSLI 

Proof. Plug Qi = | — e for i = k, 5 and £4 = ^ in (LP). The dual of (LP) is, 



lin Y ft 



mm 

i=3 

s.t. (1) S 3 + 6 4 + 6 5 + 6 6 > -, 

(2) /33+ 74 -5 3 >A 

(3) /3 3 + 2 74 >i 

(4) /? 3 + 374 >^, 

(5) A +7i+i ~7i > -J— —y i = 4,5,6, 

(6) A+i7m ~J7» > ^q~TJ ' « = 4, fe — 1,J = 1, i — 1, 

(7) A + l7l+ i - 2.25 7l > i = 4, 

1 

(8) ^ + » 7i+1 - (i - 2 + e)7i > — — , * = 5 fc- 1, 

2 + 1 

(9) Pk-hk>{, j = l,...,k-l, k>7 

1 

(9.1) k -jk-S k >-, k = 4,5,6, 

(10) ^ - (A; - 2 + e)7 fc > 1, 

(11) ft>0, i = 3,...,fc, 

(12) 7j >0, i = 4,...,fc, 

(13) 5i > 0, i = 3, 4, 5, 6. (Dual) 
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For k = 4, set 74 = ^, <5 3 = 0, S 4 = §, £3 = §, ft = 1 + ^ • |. We have 



Sj=3 A — 16 + 12 + ^ 



For fc = 5, set 74 = ^, 75 = 0, ft = i, ft = ^ + 374, ft = 1, £3 = 0, 84 = 
± + 2 74 , S 5 = i. We have E- =3 ft = § + | + 1. 

For k > 6. 

Set <5 3 = 0; 74 = ^, 7, = 7 J+2 + i(t+1 ) (i+2) for i = 6, fc - 2, 7^-1 = 0, 
7*= = (fc-i)fc ' ft = g, ft = j^i ~ iji+i + (i - 2 + e)7 4 for i = 6,...,fc - 1, 
ft = l + (fc-2 + e) 7fc . 

' i(i+l)(i+2) ~~ i(»+l) = 7i+l +7j+2 - 

(i+iW+aV By induction ' 7i + 7m - i(iTT) = T*-i + 7fe - (fc^: - 0. Hence, 

7» +7»+i = ./■ ^ tt , fori = 6,...,fc-l. 
1(1 + 1) 

Then, constraints (2), (3) and (4) hold as equality. 

In constraint (6) for i > 6, j < i - 1, ft + J7 i+ i - j'7; = ^ - i7 i+1 + (i - 
2 + e)7i + J7*+i ~ hi = 7^1 - (i ~ 3hi+i +(i-2-j + e)( -^y - 7i+i) = 

_J_/-| , i-j-2+e -, _ /„. _ 2 _ 2 • , •> > J_ , i-2-j+e _ 2j-2-2j+e _ _J_ 

In constraint (8) for z > 6, inequalities hold as equality. 

Constraint (9) holds, ft - j-y k = 1 + (k - 2)^ k - > 1 - -y k > |. 

Constraint (10) holds as equality. 

Set 75 = ^ - 7 6 , <5 4 = 274 - 275 + ±, 5 5 = 275 - 476 + ^, <5 6 = for odd A; 
and <5 6 = Y5 f° r even k. 

ft > max{i + 1 74 - 475, ^ + 74 - 75 + 54, |) + 374 - 375} = ^ + 374 - 375, 
ft > max{i + (3+e)75 - 57 6 , 30+75-76+55, ^+475-47 6 } = g + (3+e)7 5 - 57 6 . 
Let ft = A + 3 74 - 3 75 , ft = i + (3 + e) 75 - 5 76 . 

For k is odd and k > 9, 75 = 75 - 7s H h 7fc-3 - 7fc-i + 7fc-i = 2 (e^8 + 

• • • + ( fc _ 3 )( fc l 2 )(fc-i) ) i ncreases monotonically with respect to fc. Thus, for any odd fc 
and fc > 9, 76 = 2H h _ 1 - 2H k _ 2 + ^ + f < -2 ln2 + f < ^. 
For fc = 7, 76 = 0. 

Hence, Constraint (1) holds, <5 3 + <5 4 + 5 5 + <5 6 = 274 + ^ + ^-476 = |-47 6 > \ . 
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For k is even and k > 6, -f 6 = -f 6 - 7 8 H h jk-2 - Ik + Ik = 2( H h 

( fc _2)( 1 fc_i)fc ) + (fc-i)fc = 2/f* L -2 - 2H k -i + -j^j + I decreases monotonically with 
respect to k. For k = 6, — 35 • Thus, for any even and k > 6, 76 < ^j. Moreover, 
76 >-21n2 + |>i 

Hence, Constraint (1) holds, + £4 + S 5 + 5 6 = 274 + j$ + 30 ~ ^76 + T5 = 

Constraint (5) for i = 4,5, constraint (6) for i = 4,5, constraint (7) and constraint 
(8) for i = 5 hold directly as a result of the settings of these parameters. 

Constraint (5) for i = 6 holds, /3 6 + 77 — je — Sq — = j + 376 — 577 — jg- = 
i + 3 76 - 5(£ - 76) ~ " 31 = 8 7 6 - > «| - & > 0. 

Constraint (9.1) holds for fc = 6, — 76 — ^6 > 1 + 37e — jg > 33. 

Moreover, constraint (1 1), (12), (13) hold. 

Finally, we compute the value of the objective function. 

For odd k and k > 7, £*L 3 ft = A» + ft + As + Ei= 6 A = 3 + 5ff + I + 3 74 + 
£75 - 576 + Ei=6 i4tr-ni+i + («-2 + e)7i + l + (fc-2 + e)7 fc = i + ^ + i + 374- 

57 6 + l+I-7 7 +476+Et8 T-(s 1 9 + To 1 TT + - ' •+(^)+^Et 5 7, = l+t+f + 
| + f- 5 76-(^-76)+4 7 6 + 2(i + ---i) + 6Et5 7 J <l + H! + f + --- + l+ e - 

Last inequality holds because E,=5 7* < Ei=s (TTiTI < §■ 

Similarly, for even fc and k > 6, £*L 3 A = & + At + ft +E;=6 A = § + 55 + S + 
374+£75 - 576+Ei=6 Tfi-*7i+i+(i-2+c)7i+l+(A-2+c)7fc = 3 + M + l+ 3 74- 
576+l + T -77+476+Ets T-(s 1 9+To 1 TT + - ' '+ (fc-2Kfc-i) + (FITfc)+ e Ets 7* = 
1 + 5 + ! + H7- 5 76-(4^-76) + 476 + 2(i + --- 7 ^3) + T ^ T + | + eEt 5 7,< 
1 + l + ! + f + -" + I 1 3 + fe ± T + f + e - 

Therefore, the approximation ratio of PRPSLI for odd k and k > 7 can be upper 
bounded byl + 4 + | + | + -- - + -| + e. For even k and fc > 6, it is upper bounded 

by l + i + | + | + .'. . + ^3 + ^ + 1 + 6. ' □ 

9.3 Tight example of PRPSLI 

For every k > 4 and any e > 0, we give a tight example of PRPSLI based on the tight 
example of the semi-local (2, 1) -improvement [2| for 3-Set Cover, the tight example of 
the Restricted 4-Set Packing algorithm we give in Appendix Section 7.2, and the tight 
example of the Restricted k-Set Packing algorithm for k > 5, which is the same as the 
tight example of the fc-set packing heuristic [7|. 
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We assume that the optimal solution consists of only disjoint fc-sets. To calculate 
the performance ratio on this instance, we charge a cost of 1 for each set chosen by the 
algorithm, and the cost is uniformly distributed to every element of the chosen set Q. 

- k = 4. In Phase 4, the Restricted 4-Set Packing algorithm covers 1 element of 
each set in a 4(1 — e) fraction of 0, 2 elements of each set in a |(1 — e) frac- 
tion, and 3 elements of each set in the remaining e fraction. Denote the three parts 
of by 0\, 62 and 6$ respectively. In Phase 3, the semi-local optimization cov- 
ers 1 element in each set of 0\ by 3-sets, and the remaining uncovered elements 
of are covered by 2-sets. The performance ratio of PRPSLI on this instance is 



| (1 _ e)( l +1)+ l (1 _ e)( l + l +1) + e( | + l )= 7 + 1 +1 _13 



€. 



k = 5. In Phase 5, the Restricted 5-Set Packing algorithm covers 2 elements of 
each set in a 1 — e fraction of 0, 1 element of each set in the remaining e fraction. 
Denote the two parts of by 62 and G\ respectively. The algorithm switches to 4- 
Set Cover on 0\ and it performs a semi-local optimization on 02- The performance 
ratio of Algorithm 1 on this instance is (1 — e)(| + | + 1) + eQ + ^ + j% + 1) = 

- + - + 1 — -e 

5 T 3 T ± so c - 

k odd and k > 7. In Phase k, the fc-Set Packing algorithm covers 2 elements of 
each set in a 1 — e fraction of 0, 1 element of each set in the remaining e fraction. 
Denote the two parts of by 02 and 0\ respectively. In Phase k — 1, the algorithm 
covers 1 element of each set in 0\. Then it switches to (k — 2)-Set Cover on the 
remaining uncovered elements. The performance ratio of PRPSLI on this instance 
is at least (1 - e)§ + e(± + ^) + p fe _ 2 , i.e. f + pu-2 + (j^j ~ |)e, which is 

f + l+ 3" + 1 + lMo e for k = 7 > and b y induction f + ^H h f + ^ + 1 + 

( ih I + ih ih + ■ ■ ■ + I - 5 + lio ) £ for k > 9 - The coefficient of e is 

upperboundedby^-i + ^-^ + .-. + i-il^^ = hl + ljk ^ L 
Hence, the performance ratio is at most -| + + -- - + | + -| + l + e for k > 9. 

k even and k > 6. In Phase k, the fc-Set Packing algorithm covers 2 elements of 
each set in a 1 — e fraction of 0, 1 elements of each set in the remaining e fraction. 
Denote the two parts of by 02 and G\ respectively. In Phase k — 1, the (k — 1)- 
Set Packing algorithm covers 1 element of each set in 0\ and then 1 element of 
each set in 0. Then the algorithm switches to (k — 3)-Set Cover on the remaining 
uncovered elements. The performance ratio of Algorithm 1 on this instance is at 
least (1 - e)f + e(± + ^) + ^ + p,_ 3 , i.e. | + ^ + p,_ 3 + - i)e, 
which is| + i + i + l + -^e for fc = 6 and by a similar argument as the above 
case, at most | + + H h| + | + l + efor/c>8. 



